PROCESSING OF MOTION VECTOR HYSTOGRAMS FOR RECOGNIZING 
THE INTERLEAVED OR PROGRESSIVE CHARACTER OF PICTURES 



Field of the Invention 

The present invention relates to methods for 
processing digital video pictures, and, more 
particularly, to a method for detecting properties of a 
sequence of digital pictures. 

Background of the Invention 

The increasing requirements of television 
sets in terms of quality of the displayed images, and 
the increasing requirements of transmitting devices in 
terms of amplitude of the frequency bands used during 
transmission have made the techniques for transmitting 
video sequences very important. Use of digital 
transceiving systems has made possible splitting of a 
frame of a picture into a matrix of numerical values 
representing the intensities of the video components of 
each pixel of a frame. This allows processing of the 
picture by computers. The possibility of treating a 
picture as a matrix of numbers, and as a bitstream, 
offers the possibility of coding pictures to facilitate 
transmission and to improve the quality of displayed 
images . 

Systems for coding video digital sequences 
are based on the recognition of several properties of 



pictures. Among such properties, a particular 
importance is the detection of the character 
progressive or interlaced of the pictures, and the 
detection of the motion of the objects. When these 
properties are known, it is possible to compress the 
picture making most efficient their transmission in 
terms of used frequency band, and most satisfactory in 
terms of their display quality. 

Several techniques of motion estimation and 
detection of the progressive or interlaced content, 
i.e., interlaced progressive detection (IPD) of digital 
video sequences are known. Such techniques are 
disclosed in U.S. Patent No. 5,734,420, and patents WO 
99/20040, EP-A-0917363, EP-A-0944245, EP-A-097 9011 , 
EP99830545.2, and EP98830689 . 0 . These patents are 
incorporated herein by reference in their entirety, and 
are assigned to the assignee of the present invention. 
The methods disclosed in the above mentioned documents 
operate on the bit values representing the video 
components of single pixels. This requires a 
computational complexity that makes them inappropriate 
for low cost applications. 

A frame according to the PAL standard is 
composed of 720 * 288 = 207,360 pixels. Methods 
operating on each pixel are rather burdensome or impose 
simplifications that reduce the quality of the 
displayed picture. Methods of elaborating a bitstream 
of video digital sequences operating on groups of 
pixels (macroblocks) to reduce the computation 
complexity are generally used for low cost 
applications . 

More precisely, there is a need for a method 
of elaborating data of digital video sequences that 
provides the following advantages while operating on 
groups of pixels. These advantages permit recognition 
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of the processed picture as a progressive or interlaced 
picture with a reduced number of calculations. There 
should be a reduction in the amount of calculated 
motion estimations while preserving quality and 
5 efficiency of the compression. The quality of 
displayed pictures are enhanced for the same 
compression efficiency and the same amount of 
calculations performed during the motion estimation. 

Summary of the Invention 

10 In view of the foregoing background, it is 

therefore an object of the present application to 
provide a processing method usable with MPEG standard 
coded video sequences requiring the performance of 
calculations only on motion vectors. Given that motion 

15 vectors can be defined even for a macroblock, instead 

of being defined for each single pixel, an advantageous 
computational simplification may be achieved. 

The method according to the present invention 
allows the detection of the progressive or interlaced 

20 content of a picture for improving the effectiveness of 
the coding of video sequences, and the effectiveness of 
the filtering that is made on the chrominance component 
of pictures input to the coder. The method allows for 
an enhanced precision in the calculation of motion 

25 vectors by virtue of a pre-recognition of the processed 
picture as a progressive or interlaced picture. 

Another object of the present invention is to 
provide a method of processing a bitstream of coded 
data of video sequences of progressive or interlaced 

30 pictures, divisible in a top half-frame and in a bottom 
half-frame, that comprises estimating motion vectors of 
groups of pixels belonging to the top half-frame of the 
current picture in relation to pixels belonging to the 
bottom half-frame of the preceding picture, and 



estimating motion vectors of groups of pixels of the 
bottom half-frame of the current picture in relation 
pixels belonging to the top half-frame of the current 
picture . 

Starting from such estimations, respective 
top and a bottom motion coefficients depending on the 
estimation of the motion vectors of the top and bottom 
half-frame are calculated for each macroblock of the 
top half -frame and bottom half -frame. The current 
picture is recognized as a progressive or an interlaced 
picture as a function of the calculated top and bottom 
motion coefficients. A substantial equality of the 
distributions of values of motion coefficients point to 
an interlaced content by contrast, and a substantial 
inequality of the distribution of values of the 
coefficients point to a progressive content. 

The recognition can be done by calculating a 
pair of shape coefficients representing the 
distributions of the top and bottom motion 
coefficients. The current picture is recognized as 
progressive or as interlaced depending on whether the 
shape coefficients differ by a guantity greater or 
lower than a certain value. 

According to an alternative embodiment, it is 
possible to distinguish progressive pictures from 
interlaced pictures by calculating a coefficient 
representing the stochastic correlation between the 
distribution of the top and bottom motion coefficients. 
In addition, the current picture can be recognized as 
an interlaced picture or as a progressive picture 
depending on whether the calculated coefficient exceeds 
a certain value. 

The recognition is preferably established by 
comparing the top and bottom coefficients with a pre- 
established top threshold and a bottom threshold, 



respectively. The number of motion vectors whose 
motion coefficients are lower than the top threshold of 
the top and the bottom frame of the current picture are 
counted. This produces a pair of respective 
coefficients. The number of motion vectors whose 
motion coefficients are greater than the bottom 
threshold of the top and bottom half-frame of the 
current picture are counted. This produces a second 
pair of respective coefficients. The current picture 
is recognized as a progressive or interlaced picture 
depending on the four coefficients relative to the 
current picture and to preceding pictures. 

The method according to the present invention 
may also comprise the following steps. For each 
picture, a temporary weight value is calculated as a 
function of the result of the recognition of the 
picture as a progressive or interlaced picture carried 
out according to the above mentioned method. A final 
weight value is calculated as a function of the 
temporary weight value relative to the picture and of 
the final weight values relative to preceding pictures. 
A current picture is recognized as a progressive or an 
interlaced picture depending on its temporary weight 
value and on the final weight values relative to 
preceding pictures. 

These methods may be implemented to obtain a 
refined calculation of the motion vectors of a picture 
of a video sequence by a Frame-Prediction technique if 
the current picture is recognized as a progressive 
picture, or by a Field-Prediction technique if the 
current picture is recognized as an interlaced picture. 

Brief Description of the Drawings 

The different aspects and advantages of the 
present invention will result even more evident through 



the description of several embodiments of the invention 
and by referring to the attached drawings, wherein: 

Figure 1 is an example of a video sequence 
taken with a progressive camera according to the prior 
art; 

Figure 2 is an example of a video sequence 
taken with an interlaced camera according to the prior 
art; 

Figure 3 is an example of a possible 
distribution of the motion coefficients of the motion 
vectors of a top half-frame (top field) and of a bottom 
half-frame (bottom field) of a progressive sequence 
according to the prior art; 

Figure 4 is an example of a possible 
distribution of the motion coefficients of motion 
vectors of a top half-frame (top field) and of a bottom 
half-frame (bottom field) of an interlaced sequence 
according to the prior art; 

Figure 5 is a block diagram of a circuit for 
executing an algorithm that calculates comparison 
coefficients according to the prior art; 

Figure 6a is a flowchart of the interlaced 
progressive detection (IPD) procedure according to the 
present invention; 

Figure 6b is a detailed flowchart of the 
functional block Hysteresis illustrated in Figure 6a; 

Figure 6c is a flowchart of the functional 
block Quasi-stationary illustrated in Figure 6a; 

Figure 7 is the scheme of detection of the 
progressive or interlaced content of pictures according 
to the present invention; and 

Figure 8 is a block diagram of a generic two 
step motion estimator using the IPD prcedure according 
to the present invention. 
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Detailed Description of the Preferred Embodiments 

The method according to the present invention 
allows the recognition of pictures of digital video 
sequences as interlaced or progressive pictures 
5 irrespectively of the standards used for coding. The 
method exclusively depends on previously performed 
estimations of motion vectors relative to groups of 
pixels. The method can be exploited to improve the 
calculation of motion vectors in terms of augmented 

10 precision and/or in terms of reduced computational 
complexity. 

The fundamental approach of the present 
invention comprises always carrying out motion 
estimation through a comparison of parameters of 

15 consecutive half-frames to verify whether the pictures 
to code are interlaced or progressive. A frame of a 
picture is divisible in two half-frames (fields), 
commonly called top (or odd) field and bottom (or even) 
field. This is because of the position of the lines 

20 forming the frame. 

In case of a progressive sequence, if the 
motion estimation is carried out on the bottom field 
with reference to the top field belonging to the same 
frame, the modulus of the resulting motion vectors will 

25 tend to zero. This happens because the two fields have 
been sampled by a progressive camera at the same time 
instant (in fact they belong to the same frame), and 
not at an interval of 1/50 (or of 1/60) of a second, as 
in the case of an interlaced PAL or NTSC camera. 

30 Moreover, is very probable that the vertical 

component of motion vectors be between -1 and 1. This 
is because of the vertical difference of a line between 
fields of different parity, while the horizontal 
component is practically zero. It is always possible 

35 that zones with an elevated vertical detail (spatial 



frequencies) deceive the estimator by having it 
estimating a non-existent vertical motion. However, it 
is supposed to have a robust motion estimator not 
affected by this problem. 

In contrast, motion estimation between the 
top field of frame k and the previous bottom field of 
the frame k-1 will detect an effective motion, not only 
in an interlaced sequence, but also in an eventual 
progressive sequence. The two fields are sampled at 
different instants, and in fact they belong to 
different frames. 

Figures 1 and 2 illustrate this concept. In 
particular, Figure 1 shows the motion of a ball taken 
by a progressive camera, and Figure 2 shows the same 
scene taken by an interlaced camera. As can be seen, 
while from a field to the successive of the sequence of 
Figure 2, the motion vectors MV are always different 
from zero. This is not true for the motion vectors of 
the sequence of Figure 1. In fact, the motion vectors 
MV from a bottom field to a top field of the same frame 
k or k+1 are always zero, while the motion vectors MV 
from a top field belonging to the frame k+1 to the 
bottom field belonging to the frame k are always 
different from zero. The critera of the invention for 
distinguishing in a relatively straightforward manner 
interlaced pictures from progressive pictures is based 
on such a peculiarity. 

The variable MVtop x , y (k) is indicated as the 
motion vectors of the k-th top half-frame with respect 
to the (k-1) -th bottom half-frame, and MVbot x , y (k) is 
indicated as the motion vectors of the k-th bottom 
half-frame with respect to the k-th top half-frame, 
where k is the frame number. Such motion vectors MV 
=(V x ,V y ) may be relative to single pixels or to groups 



of pixels (macroblocks) of the k-th picture, belonging 
to the top half-frame or the bottom half-frame, 
respectively. 

To implement the method according to the 
present invention, it is necessary to obtain by 
estimating, even coarsely, the motion vectors MVtop x , y (k) 
and MVbot x<y (k), as many respective motion coefficients 
Vtop x , y (k) and Vbot X/Y (k), and to recognize the current 
picture as a progressive or interlaced picture 
depending on such coefficients. 

A preferred way of calculating the motion 
coefficients Vtop x , y (k) and Vbot X(Y (k) is to calculate for 
each motion vector MV the sum of the absolute value of 
its components, i.e., to make 

VTOP x , Y (K) = |VJ + |V y | 

This is done by considering the motion vectors 
MVtop x , y (k) of the top half-frame of the k-th picture, 
and to make 

VBOT x , Y (K) = |Vj + |V y | 

This is done by considering the motion vectors 
MVbot X(Y (k) of the bottom half-frame of the k-th picture. 
The above way of calculating the motion coefficients is 
only one of several possible ones. For example, it 
would be satisfactory to calculate the motion 
coefficients as the square root of the sum of squares 
of V x and V y , as well as in other ways, as will appear 
evident to one skilled in the art. 

By way of an example, the case where the 
motion coefficients Vtop x , y (k) and Vbot x , y (k) are calculated 
as the sum of the absolute values of the components of 
the motion vector MV will be described below. The 
method may also be implemented by calculating motion 
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coefficients in a different manner. 

It is possible to distinguish an interlaced 
picture from a progressive picture by examining the 
distribution of the motion coefficients of its top 
half-frame and of its bottom half-frame. It has been 
found that progressive pictures are characterized by 
notable differences in the distribution of top and 
bottom coefficients, while interlaced pictures are 
characterized by distinctly more uniform distributions 
of motion coefficients. 

This concept is clearly shown in Figures 3 
and 4 wherein two typical histograms of the respective 
distributions of motion coefficients of a top half- 
frame (top field) and of a bottom half-frame (bottom 
field) , relative to progressive sequences and 
interlaced sequences, respectively, are depicted. 

In particular, in Figure 3 the difference 
between the two distributions is evident. The 
distribution of coefficients of the bottom field is 
characterized by a greater concentration (vertical 
axis) near the value 1 (horizontal axis), while the 
distribution of coefficients of the top field has a 
greater uniformity. On the contrary in Figure 4, the 
two histograms are almost identical. 

Upon testing many different video sequences, 
calculating the respective distributions of motion 
coefficients, obtained with different motion 
estimators, it has been seen that the distributions of 
motion coefficients of bottom fields belonging to 
frames of movie sub-sequences (film-mode) show a 
distinct concentration about the maximum value, which 
is near to 1. This is because of the vertical 
component of vectors. On the contrary in top fields 
such a maximum is distinctly lower, i.e., the 
dispersion is higher than in bottom fields. Also, the 



distributions of motion coefficients of top fields and 
of bottom fields belonging to interlaced sequences have 
a substantially similar shape, and the dispersion of 
the distributions is greater the more motion is 
present . 

The same significant shapes are obtained even 
for a single frame in addition to whole sequence. An 
interlaced picture is characterized in that the 
distribution of top motion coefficients is almost 
identical to that of bottom motion coefficients. This 
is while a progressive picture is characterized by a 
notable difference between distributions of top and 
bottom motion coefficients, and by a relatively 
stronger clustering near the maximum value of the 
bottom motion coefficients than of the top motion 
coefficients . 

The detection of one of the above mentioned 
characteristics is exploited for recognizing the 
progressive or the interlaced content of a picture. 
The stochastic variable theories give several methods 
for recognizing one of the above mentioned properties. 
Such methods may include calculating a pair of shape 
coefficients representing the distribution of the top 
and of the bottom motion coefficients, and in verifying 
whether between the two shape coefficients there is a 
substantial difference or not. In the positive case, 
the two distributions are recognized as similar, 
otherwise as different from each other. 

Alternatively it is possible to calculate a 
coefficient representative of the correlation of the 
two distributions, and to verify whether such a 
correlation coefficient exceeds a certain value. In 
the positive case the picture is interlaced, otherwise 
the picture is progressive. 

Yet another method may include using as a 
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shape coefficient the variance with respect to the 
maximum value of the two distributions of motion 
coefficients. Given that for a progressive picture the 
values of motion coefficients of the bottom field are 
5 more clustered near their maximum than the values of 
motion coefficients of a top field, the variance for 
the top field will be greater than the variance for the 
bottom field. Therefore, if the variance of the motion 
coefficients of the top field exceeds by a certain 

10 percentage the variance of the motion coefficients of 

the bottom field, the picture is progressive, otherwise 
the picture is interlaced. 

A particularly interesting application of the 
invention for performing a pre-test of the type of 

15 pictures includes recognizing a picture as progressive 
by way of the motion coefficients. Sometimes in video 
sequences there are pictures called quasi-stationary, 
whose motion is limited to relatively small portions of 
the frame. A typical quasi-stationary picture is a 

20 picture representing a TV news speaker, wherein the 
motion is substantially limited to the face of the 
speaker while the remaining portion of the picture 
remains substantially static. 

In such a situation there is a relatively 

25 small difference between a progressive and an 

interlaced picture. Therefore, without introducing 
relevant errors it is possible to classify the examined 
picture as a matter of convenience. Because a most 
important application is for video sequences to be 

30 coded according to the MPEG standard, the quasi- 
stationary pictures may be considered progressive 
because such a choice allows a considerable reduction 
in the number of calculations needed for the MPEG 
coding. 

35 The recognition of pictures as quasi- 
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stationary pictures is carried out by producing two sum 
coefficients, respectively top SVtop and bottom SVbot, 
as the sum of all top and bottom motion coefficients, 
respectively. This is represented by the signal B of 
5 the quasi-stationary block of the schemes of Figure 6a 
and 6c. If both sum coefficients SVtop and SVbot are 
lower than respective pre-established positive numbers 
Tl and T2, that is: 

SVtop < Tl and SVbot < T2 (1) 

10 then the examined picture will be recognized as a 
quasi-stationary picture. 

Given the great simplification resulting from 
the possibility of treating pictures as quasi- 
stationary pictures, it is possible to conveniently use 

15 the above described test before any subsequent test for 
detecting eventually the progressive or interlaced 
content of a picture. A preferred way of 
discriminating progressive pictures from interlaced 
pictures includes defining two pre-established 

20 thresholds, respectively bottom THR1 and top THR2, and 
comparing the top motion coefficients Vtop x , y (k) and 
bottom motion coefficients Vbot X/Y (k) with these 
thresholds, indicating with the following: 

NItop(k) = number of all MVtop X/Y (k) 
25 such that Vtop x , y (k) <THR1 

N2top(k) = number of all MVtop XiY (k) 

such that Vtop x , y (k) >THR2 (2) 

NIbot(k) = number of all MVbot x , y (k) 
such that Vbot x , y (k) <THR1 



30 



N2BOT (K) 



= number of all MVbot x , y (k) 
such that Vbot x , y (k) >THR2 



-14- 

An example of a possible functional block 
diagram for calculating the parameter NItop(k) of a top 
field is shown in Figure 5. The architecture for 
calculating the other three parameters is the same. 

Referring to the histograms of Figures 3 and 
4, NItop(k) and NIbot(k) give an indication of the 
maximum value of the distributions (if THRl is greater 
than such a maximum value), while N2top(k) and N2bot (k) 
indicate their dispersion (width of the dome-shaped 
curve enveloping the histograms) . 

The following ratios are defined: 

R1(k)= NItop(k) / NIbot(k-I) 
R2(k)= NIbot(k) / NItop(k) (3) 
R3(k)= N2top(k) / N2bot(k-1) 
R4(k)= N2bot(k) / N2top(k) 

Calling y, 5, £ and n four pre-established 
numbers, if all the following tests verify: 

Rl (k) < y 

R2 (k) > 5 (4) 
R3(K) > S 
R4 (K) < n 

then the k-th frame is recognized as progressive, 
otherwise it is recognized as interlaced. The choice 
of using all four parameters instead of using only 
R2 (k) and R4 (k) , for example, is due to the fact that 
because a decision must be taken for each frame, it is 
preferable to use all available information to minimize 
uncertainty. 

To prevent errors, it is also possible to 
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apply the following conditions: 



R5 (K) = R2 (K) / Rl (K) > 9 (5) 

R6 (K) = R3 (k) / R4 (k) > l 

The variables 9 and i are two pre-established positive 
5 numbers. The latter conditions impose a more 
distinguishable difference between the two 
distributions for classifying a picture as a 
progressive one. 

An advantage of the above described 

10 procedures includes the possibility of operating on 

motion vectors established for macroblocks rather than 
exclusively for single pixels. This permits an evident 
simplification of calculations that makes the method of 
the invention particularly appropriate even in low cost 

15 applications. 

To avoid oscillations in the results provided 
by the procedure of the invention for detecting the 
interlaced or progressive content of the picture, a 
simple hysteresis mechanism that considers the 

20 decisions taken in preceding frames may be introduced. 

The oscillations could be caused by short portions of a 
video sequence wherein the motion is such to make the 
discrimination very difficult 

A hysteresis mechanism may be realized by 

25 associating to each picture a temporary weight value 
P (k) depending on the recognition of the picture as a 
progressive or an interlaced picture, according to one 
of the above described methods of the invention. For 
each processed picture a final weight value D (k) is 

30 calculated as a function of its temporary weight value 
P(k), and of the final weight values D(i) relative to 
one or more preceding pictures. The current picture is 
recognized as a progressive or interlaced picture based 
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on its final weight value. 

An example of how such a hysteresis algorithm 
may be realized is shown in the flowchart of Figure 6a. 
First, the number of previous pictures to be considered 
5 must be decided by attributing a value to the parameter 
NF, for example NF=3 . Afterwards, the system applies 
an arbitrary technique for estimating the motion 
vectors and calculates the motion coefficients Vtop X(Y (k) 
and Vbot X(Y (k) . Starting from the values of these 

10 coefficients, the block quasi -stationary verifies whether 
the picture is quasi-stationary or not, according to 
the above described algorithm. 

In the positive case, the picture is 
classified as progressive, its final weight is 

15 established D(k)= go and a successive picture is 

processed. In the negative case, the system calculates 
the ratios Rl (k) , R2 (k) , R3 (k) , R4 (k) , R5 (k) and R6 (k) 
according to equations (3) and (5) . If these ratios 
satisfy the inequalities (4) and (5), the block 

20 Hysteresis, described in more in detail in Figure 6c 
associates a temporary weight P(k) to the picture. 
Such a temporary weight is equal to a number p if the 
inequalities (4) and (5) are satisfied, otherwise the 
temporary weight is equal to a. 

25 When the temporary weight is known, the block 

Hysteresis recognizes the current frame as a progressive 
frame if the sum of weights associated to the last two 
frames and to the current frame is greater than or 
equal to a certain number m, i.e.: 

30 P(K) + D(k-1) + D(k-2) > u 

and the frame gets a final weight D (k) = (3. In the 
contrary case, the current frame gets a final weight 
D(k)= a, and is classified as interlaced. 



If equations (4) and (5) are not verified 
(input N of the Hysteresis block in Figures 6a and 6b), 
the current frame k is temporarily classified as 
interlaced and assumes the temporary weight P (k) = a. If 
the complementary condition is verified: 

P(K) + D(K-l) + D(K-2) < u 

The final choice is an interlaced frame with a final 
weight D(k)= a. In the contrary case, the frame is 
classified as progressive and assumes the final weight 
D(k)= co. According to a preferred embodiment of the 
invention, the weight a, (J, u and co are equal to 0, 1, 
1.5, 0.5, but these values may be changed depending on 
the needs, even during the same picture sequence. 

With such a hysteresis procedure, the 
effective choice made in previous frames using the 
weights D(K-l) and D(k-2), and the indication given by 
the various decision parameters Rl (k) , R2 (k) , R3 (k) , 
R4 (k) are considered, without giving a priority to any 
one of the two possible choices. 

A block diagram of an embodiment of the 
method of the invention is depicted in Figure 7. 
Starting from an estimation of the motion vectors, it 
is verified if the current picture is quasi-stationary. 
Then the algorithm of the invention (IPD) is executed, 
and finally a functional block implementing the 
hysteresis procedure produces a flag indicating whether 
the picture is progressive or interlaced. 

Using the hysteresis procedure, an abrupt 
change from the interlaced to the progressive scanning 
mode is prevented, thus avoiding short lasting 
disturbances (noise) that may affect the decision. The 
method of the invention allows for the calculation of 
the motion vectors in a refined manner, by adaptively 
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choosing the calculation algorithm that implies the 
lowest number of calculations. 

Given the importance assumed by the MPEG2 
standard in treating sequences of digital pictures, in 
the following text reference will be made to a MPEG2 
system, although the same considerations will be 
similarly applicable, even in systems based on 
different standards. 

In the MPEG2 standard, as in other standards, 
at least two different techniques of motion estimation 
and consequent temporal prediction are possible. One 
technique is Frame Prediction, wherein the current 
picture is divided in macroblocks of frames (16x16 
pixels) , and for each of them the predictor in the 
preceding frame is found with an arbitrary method for 
motion estimation. 

Another technique is Field Prediction, 
wherein each macroblock frame is divided in its two 
component fields (16x8 pixels) . One on the even half- 
frame and one on the odd half-frame, and for each of 
them the predictor on the field of the same parity 
belonging to the temporarily preceding picture is 
found. 

The best coding is obtained by testing all 
possible combinations of predictors of a macroblock. 
These include frame, field top k on field top k-1, 
field top k on field bottom k-1, field bottom k on 
field top k-1, field bottom k on field bottom k-1. In 
all cases, the prediction (and the motion estimation) 
can be Forward or Backward. Therefore, a great number 
of matching errors need to be calculated for each 
macroblock. 

This becomes burdensome for low cost 
applications. Interlaced sequences with little motion 
may be coded as progressive, without any relevant loss 
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of quality, thus avoiding a burdening Field Prediction. 
Similarly, an interlaced sequence with considerable 
motion would not have any advantage of being predicted 
as a Frame, and they could satisfactorily require only 
5 a Field Prediction estimation. 

It is evident that a large computational 
simplification (from about 30% to about 50%) is 
obtainable by carrying out only a Field or a Frame 
estimation. It would be necessary just to have a 

10 mechanism for telling the motion estimator when to 

operate in Field mode and in the Frame mode. This is 
provided by the IPD algorithm of the present invention. 
The only requirement is that the motion estimation be 
carried out on consecutive fields in one or more steps. 

15 The first step, carried out by the block 

Coarse search depicted in Figure 8, is necessary to 

calculate coarse estimations of motion vectors from 
successive frames. The second step, carried out by the 
block Fine search, is necessary to calculate a finer 

20 estimation of the motion vectors needed to the 

effective coding. At the end of the first step it is 
possible to carry out the IPD procedure of the 
invention to implement a fine motion estimation in the 
second step. 

25 Finally, the IPD method of the invention, 

when associated to a method for motion estimation, 
provides for alternative advantages as compared to the 
use of the motion estimator without any IPD modulus of 
the invention. A better quality of the predicted 

30 pictures for the same maximum number of calculated 
matchings per macroblock is provided because it is 
possible to increase the number of Frame only or Field 
only predictors (up to certain maximum number) by 
choosing the prediction mode depending on the 

35 indication provided by the IPD modulus. A lower number 
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of matchings per macroblock is obtained for the same 
quality of predicted pictures because either a Field 
prediction or a Frame prediction is carried out. This 
depends on the indications provided by the IPD modulus, 
5 thus resulting is a savings from 30% to 50% of the 

number of matchings depending on the motion estimation 
mode . 



